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Abstract 

The relative computational effort among the spatially five-point numerical flux functions of 
Harten, van Leer, and Osher and Chakravarthy is explored. These three methods typify the 
design principles most often used in constructing higher than first-order upwind total variation 
diminishing (TVD) schemes. For the scalar case the difference in operation count between any 
two algorithms may be very small and yet the operation count for their system counterparts 
might be vastly different. The situation occurs even though one starts with two different yet 
equivalent representations for the scalar case. 


I. Introduction 

Various high-resolution upwind difference schemes have been developed in recent years [1- 
9]. Of special interest are the extension of these methods to systems of nonlinear hyperbolic 
conservation laws in one and higher dimensions. Typical methods for systems are MUSCL 
[1,2], Osher [6], Harten [3], Roe [5], and van Leer [1]. Here van Leer method means using 
his methodology [1] to extend the first-order Roe’s scheme [5] to higher than first-order and 
MUSCL means van Leer’s pioneer work on extending the first-order Godunov scheme to higher- 
order. The MUSCL and Osher type of approach require considerably more computation and 
programming effort than with the latter three, especially for problems in multidimensions and in 
curvilinear coordinates. Harten and Roe’s second-order schemes share many common features. 
Considering only the latter three, the way second-order accuracy is achieved and the form of 
limiters play an important role. For the scalar case, the difference in operation count between 
any two algorithms may be very small and yet the operation count for their system counterparts 
might be vastly different. This situation occurrs even though one starts with two different yet 
equivalent representations for the scalar case. An example will be given in section V. 

Recently, Osher and Chakravarthy [10] extended their earlier work on five-point fully upwind 
second-order total variation diminishing (TVD) schemes to five-point second-order or third-order 
upwind biased TVD schemes. They also derived formulas for more than five points and higher 
than third-order cases. Only the second and third-order are discussed here. They claimed that 
their five-point second-order or third-order upwind biased scheme requires hardly anymore work 
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than their five-point fully upwind second-order scheme. It was the investigation of their work 
which prompted the work of this note. As one can see later for system cases, their five-point 
upwind biased schemes require more operations than their fully upwind second-order scheme. 
Furthermore, their five-point fully upwind or upwind biased schemes require substantially more 
operations than the Harten or van Leer method. For the purpose of this paper, order of accuracy 
of a TVD scheme refers to the numerical solution that is away from extreme points. 

Specifically, it is the objective of this note to discuss the relative computational effort among 
the spatially five-point TVD schemes of Harten, van Leer and Osher and Chakravarthy [3,7,10] 
for system cases via the Huang [11] and Roe’s [12] generalization for systems. It is emphasized 
here that, the scalar schemes of Harten, van Leer and Osher and Chakravarthy are used. Then 
extensions to systems are employed by the same method (i.e. Huang or Roe), not by their 
original Riemann solver [1,13] for system cases. Huang and Roe’s generalization to system cases 
is based on a local linearization which defines at each point a “local” system of characteristic 
fields, and then applies the scalar scheme to each of the scalar characteristic equations. Because 
of the freezing of coefficients, one can transform the local characteristic field back to the original 
field variables, and arrive at an algorithm for the nonlinear hyperbolic system case. Numerical 
experiments with these schemes on one and two-dimensional steady-state calculations [14-20] 
show that they produce similar shock resolution. In certain instances, the third-order scheme 
gives a slightly better result than the second-order method [20], In general, the improvement 
from second-order to third-order is far less pronounced than from first-order to second-order 
TVD schemes [19], This is due partly to the fact that all TVD schemes of higher than first- 
order reduce to first-order at points of extrema. Knowing their performance, it is relevant to 
compare their computational effort. 

The comparison is only on the numerical flux function. No specific time discretization is 
involved. For simplicity in presentation, the forward Euler time differencing is assumed. One 
should notice that under the forward Euler time differencing only Harten’s method is second- 
order in time and space; the other two are first-order in time and second-order in space. As 
a side remark, TVD schemes are not necessarily upwind [21-23], but upwind TVD schemes, in 
general, give sharper shocks than symmetric TVD schemes. The advantage of symmetric TVD 
schemes is that they are easier to implement and provide a more natural way of extending the 
scheme to two and three-dimensional problems. One can easily modify Harten’s method to be 
non-upwind. In this note, only the aforementioned three upwind methods are considered. 


II. Harten TVD Scheme 

Consider a scalar hyperbolic conservation law 


fto , d/(«) 

dt dx 


= 0 , 


( 2 . 1 ) 


where / is the flux function and a(u) = df/du is the characteristic speed. Let u” be the 
numerical solution of (2.1) at x — jAx and t = nAt, with Ax the spatial mesh size and At the 
time step. Consider a general five-point explicit difference scheme in conservation form 



„“+' = „»-A(k“ +4 -X”_ 5 ), (2.2) 

where = h{u^ 1 , «”, u" ±1 , «^' ;±;2 ), and A — At/Ax. Here h, commonly called a numerical 

flux function, is required to be consistent with the conservation law in the following sense 

h{u,u,u,u) = f{u). (2.3) 


The numerical flux of the second-order explicit TVD scheme of Harten [3] can be expressed 
as 

= | P U + &+• ~ (2-<») 


where f, = /(«_,) + g j, a j+ i = a j+ i + 7 j+ i, and A j+ au = u J+ i - u., , with 
#0 = 1*1 


(2.4b) 


(/i+i - /y)/A i+ *« 

«K) 


Ay+itt ± 0 
A i+ *t. = 0 


(2.4c) 


gj = S • max [o, | A j + au|, 5 • 0j _ 1 A,-_ . tt)] (2.4d) 

5 = sgn(A,- + |«) 

and 


Here <r J+ 1 = ff(a J + i.) and can be expressed as 

a ( z ) = \W Z )- A ^ 2 ] > 0 (2- 4f ) 

for time accurate calculations. With this choice of < 7 ( 2 ), the scheme (2.2) together with (2.4) 
is second-order accurate in both time and space (except at points of extrema); see [3,4], Also, 
with this choice of the g function, the scheme will automatically switch itself to first-order at 
points of extrema. This is one of the vehicles to prevent spurious oscillations near a shock. 

Define a “minmod” function of a list of arguments to be the smallest number in absolute 
value if the list of arguments are of same sign, but to be zero if any argument is of the opposite 
sign. One can rewrite (2.4d) as gj = minmod(<7 J+ 1 A J+ ^u,aj_i Aj_iu). It is a form of the 
so called “limiter” for the control of unwanted oscillations in numerical schemes. Other more 
general forms for g can be obtained following the line of argument of Roe [5]. As a side remark, 
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J(0y+i-0a)/A y+ *u A,-. 

: \0 ' A,. 


u ^ 0 


(2.4e) 



if one sets all the g/s to zero in (2.4), the resulting scheme becomes Roe’s first-order upwind 
method. 


With the choice of the ip function in(2.4b), the corresponding first-order upwind scheme is not 
consistent with an entropy inequality, and the scheme might converge to a nonphysical solution. 
A slight modification of the numerical viscosity term 


~ { (z 2 + e 2 )/2e 


M>« 

M<« 


(2.5) 


can remedy the entropy violating problem [3,4], where c > 0 is a parameter. A formula for 
e can be found in reference [24]. The ip(z) in (2.5) is a continuously differentiable positive 
approximation to |z| in (2.4b). 

For simplicity of presentation, assume ip(z) has the form (2.4b). The use of an entropy 
satisfying ip(z) is only a minor modification of what is about to be presented. Furthermore, 
since the main concern of this note is the spatial discretization, only the forward Euler time 
differencing is considered. Other explicit, implicit, or predictor and corrector types of time 
differencing do not alter the results. 

Using the identity |a J+ i| = sgn (o J+ i)a J+ i and a J+ i — (fj+i - fj+i)/(u J+1 - u 3 ), Harten’s 
scheme can be rewritten as 


u ” +1 = u ? “ \ l 1 ~ s g n ( S ”+i)] U?+ 1 “ f?) - \ [* + sgn(a”_i)] (£ n - /7_ x ). (2.6) 

Here, the numerical flux function becomes 

h” + i = \[l~ sgn(S J+ r)] (/~ +1 - Ji) + Ji. (2.7) 

Note that with the choice of gj given in (2.4d), sgn(a” +i ) = sgn(a" +i ). Thus, one does not 
have to calculate 7 J+ i. The hf + i can also be expressed as 

fcf+i = \ [i - sgn(a J+ x)] (/ J+1 - Ji) + Ji- ( 2 - 8 ) 

With the use of the numerical flux (2.7) as an alternative, the g function of (2.4d) can be 
defined in a slightly different form. 

9 j = minmod (2.9a) 


with tr J+ i = c(a J+ i) and 

y (*) = § - (2.9b) 

Here the identities sgn(A J+ i/)sgn(o J+ i) = sgn(u J+1 - u y ) and |A J+ i/| = |o y+ i||u y+ i- u 3 | 
are used. The above limiter g 3 of (2.9) can be considered as a flux limiter since the flux / is 
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limited. For scalar problems, equations (2.4a), (2.7) and (2.8) are equivalent. The difference in 
operation count are very minor. Equations (2.4d,f) are preferred over (2.9) for its straightforward 
extension to system cases because u appears rather than /. Furthermore, in the extension to 
nonlinear systems via Huang’s and Roe’s generalizations, (2.4a) results in a smaller operation 
count than (2.7) or (2.8). 


III. van Leer Differencing 

In references [7,19,25] van Leer and his collaborators adopted the original MUSCL [1] type 
of differencing to obtain a second and third-order flux vector splitting method. Very impressive 
results were obtained for steady-state airfoil calculations. Their result indicated that one does 
not have to resort to a Godunov type of Riemann solver in order to obtain good shock resolution. 
In this paper the application of the MUSCL type differencing to Roe’s first-order scheme is 
proposed. 

Consider a three point explicit difference scheme in conservation form 


«;■» = «?-a (Aj +i -*?_,) 

= u” - A[A(uJ,uJ +1 ) - *(«?.„«?)] (3.1) 

where h" +A = /i(u",u” +1 ), is a first-order numerical flux. Here without a “ ~ ” is used 

to distinguish a first-order numerical flux from the higher-order numerical flux /»- + i. 

Roe’s first-order scheme (without entropy fixed) can be written as 

= 2 \fi f 3 ~ K+f l A y+f u ]’ (3-2) 


with o, + a defined in (2.4c). 

The MUSCL approach to obtain a spatially higher order differencing is to replace the argu¬ 
ments uy+i and Uj by it^ +i and A where 

<**+} = «<+i - j [<i - •) +(> + *)A+*«] P- 3 ) 

+ J [(1 “ + (l + tf ) A ,44 u ] * ( 3 ' 4 ) 

Here the order of accuracy in space is determined by the value of 6. For example, if 0 = — 1, 
the scheme is fully upwind and, if 6 = 0, it is the Fromm’s scheme. When 9 = 1/2, the scheme 
is third-order and, when 9 = 1, it is the regular three-point central difference scheme. 

Various “slope” limiters are use to eliminate unwanted oscillations. A popular one is the 
“minmod” limiter; it modifies the upwind-biased interpolation (3.3)-(3.4) as follows: 
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A J+ i« = minmod 

(3.7) 


A J+ iu = minmod (A j+ xu,pA j+ zuj 

(3.8) 

with 

minmod(z,/?y) = sgn(i) • max jo,min[|a:|,/?ysgn(x)] j 

(3.9) 

and 

» 3-0 

P ~l^0- 

(3.10) 

Therefore, a higher 
in equation (3.1); i. 

order scheme can be obtained by simply redefining the arguments of hj ± v 
e. 


-r = a [m- j b +s ,“'' +1 ) - 

(3.11) 


Applied to Roe’s scheme, the second-order numerical flux in (3.11) denoted by hY+ x is 


% = *(«£».<*) 

Anderson et al. [19] have demonstrated that the use of limiters is not necessary via the 
MUSCL differencing for some steady-state airfoil calculations. In reference [19], they use the 
fully one-sided unlimited second-order upwind scheme (0 = -1 in (3.4)-(3.5)) via the flux vector 
splitting of both van Leer [7] and Steger and Warming [26]. Very accurate steady-state numerical 
solutions were obtained for some typical airfoil calculations with shocks. 

For the fully upwind scheme with 0 = -1 and without limiter, and in (3.12) can 

be defined as 


|/<«? + t>-/(■&*> 






( u ?+i - u 5+0\ 


(3.12) 



the limited version 


(3.13) 

(3.14) 


(3.15) 

(3.16) 

For the unlimited case (3.13)-(3.14),less computation is required for (3.11) than Harten’s scheme 
(2.2). However, for the limited case (3.15)-(3.16) or (3.5)-(3.6) with 9 ^ —1, an extra limiter 
is required over the Harten scheme. This is due to the asymmetric nature of the scheme. In 
general (3.12) requires one more flux evaluation than Harten scheme requires. 

IV. Osher and Chakravarthy TVD Schemes 

Instead of using the MUSCL differencing, Osher and Chakravarthy [10] started with a one 
parameter family of semi-discrete schemes with numerical flux 


y-i/ + 

(4.1) 

where h J + i = h(uj,Uj+i) is the numerical flux of some first-order upwind TVD scheme. For 
comparison purposes, define as in (3.2). This five-point scheme again can be second or 

third-order depending on the value of the parameter 6. It is defined the same way as before. 
The superscript + or — denotes the flux difference across the wave with positive or negative 
wave speed. To obtain a TVD scheme, Osher and Chakravarthy used the flux limiter concept 
by simply modifying the last four terms on right hand side of (4.1); i.e., their numerical flux 
denoted by h9+ L is 


i (I-*) (, (1 + 0) ( 

Vi—i (Vi 7 ) — r~v 


( 1 + 0 ) / 


, (i-0) / 


R 1 * 

Uy+1 = «i+1 - 2 V|« 

“?+* = U J+1 - ^ V§“ 



The “ ~ ” shown over the flux difference values denotes the flux limited value and they are 
computed as follows 


A J+ |/-= minmod [A i+ |/ ,/3 A j+ i/ ] 


(4.3a) 
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(4.3b) 


A j+ if~ = minmod [Ay + i/ _ ,/9A J+ |/ _ ] 

A J+ i/+ = minmod [Ay + i/ + ,/?Ay_i/ + ] (4.3c) 

Ay_i/+ = minmod [Ay_i/ + ,/?A J+ i/ + j (4.3d) 

where the operator “minmod” is the same as (3.9) and is defined almost the same way as in 

(2.4d). The value ft is the same as in (3.10). Due to the design principle of this scheme, the 

numerical flux (4.2) requires more computation than (2.4), (2.8) or (3.12). For 9 = -1, (4.2) 
together with (4.3) requires one more limiter than Harten’s method. For 0^—1, two more 
limiters than van Leer’s method and three more limiters than Harten’s method are required for 
(4.2)-(4.3). The extra computations will become even more apparent as we extend these schemes 
to system cases. 


V. Generalization to Hyperbolic System of Conservation Laws 

The procedure considered here for generalization to system cases is to extend the scalar scheme 
to system cases so that the resulting scheme is TVD for the “locally frozen” constant coefficient 
system. To do that, we define at each point a “local” system of characteristic fields, and then 
apply the scheme to each of the m scalar characteristic equations. Here m is the dimension of the 
hyperbolic system. The extension technique is a somewhat generalized version of the procedure 
suggested by Huang [11] and Roe [12]. For simplicity of presentation, only conservation laws in 
Cartesian coordinates with equal spatial step size will be discussed. 

Consider a nonlinear hyperbolic system of conservation laws 


3U dF(U) 
dt + dx 


A{U) = 


dF{U) 

dU 


(5.1) 


Here U and F(U) are column vectors of m components. Let the eigenvalues of A be 
(a 1 , a 2 , ...,a m ). Denote R as the matrix whose columns are eigenvectors of A, and Jfi -1 as 
the inverse of R. Define the characteristic variable of W with respect to the state U by 


W — R~ X U. (5.2) 

In the constant coefficient case (5.1) decouples into m scalar equations for the characteristic 
variables 


dw l ,dw l 
dt dx 


0 , 


(5.3) 


with ^constant and w l the elements of W. This offers a natural way of extending a scalar 
scheme to a constant coefficient system by applying it to each of the m scalar characteristic 
equations (5.3). 



Let Uj + 1 denote some symmetric average of U 3 and Z7y+i such as the arithmetic mean average 
of Huang or the Roe’s average [11,12] for gas dynamics. Let a'. + i , Rj + i, RJ+x denote the 
quantities a 1 , R, f? -1 evaluated at U J + x. Let w l be the vector elements of W and let a*. + A = 
w j+i ~ be the component of A J+ il7 = Uj+i - Uj for the /th characteristic; i.e., 

“j+4 = - R i+V A »'+* £ ') ( 5 - 4 “) 

and 

A i+i U = R i+h a i+i- ( 5 - 4b ) 

With the above notation, one can apply the scalar scheme to each of the locally defined 
characteristic variables of (5.3). Transforming back to the original variable locally, one gets 

= cr;- a(f; + *( 5 . 5 ) 


Numerical Flux for Harten Scheme 

The corresponding numerical flux of (2.4) can be written as 

ff" J = \ [*V + + « i+4 * )+ i], (5 6a) 

where the elements of the $ J+A denoted by ^*. + A ,/ = l,...,m are 

= 9\ + 9j+i ~ 1 + Tf} +i )aJ +i (5.6b) 

9j = minmod (cr*. + | a'. + A , cr'_ A a'_ A ) (5.6c) 


with ip(z) defined in (2.4b); c 


i is (2.4f) with z = a*. + A , and 

«+1 “ 






where a*. + A are the elements of (5.4a). 


(5.6d) 


Extension of the scalar TVD scheme to nonlinear system cases is not unique. Take, for 
example, the case where the numerical flux /i J+A in (2.4) is identical to hj + x in (2.8). The 
corresponding numerical fluxes for the system case have a different form depending on which of 
the scalars hj + i one started with. If one started with (2.8), the system can be expressed as 

or= u ? -1 (j- *u) w+i -m-\ r - ?-.) m 


where 



A j+k = R jit k i+l R i+l 

(5.7b) 

Aj+5 = dia S [*»(•}+*)] , 

and one way to define 7j is 

(5.7c) 

7j = Fj + RjGj 

(5.7d) 

with Gj = (<;*, ...,g™) T and diag(e*) denoting a diagonal matrix with diagonal elements z l . The 
numerical flux Hj ± i can be of the form 

B i+i = l(i - A l+i )p j+t - rj + T, 

(5.8a) 


(5.8b) 

Eventhough Rj + x is needed in both (5.6) and (5.8), scheme (5.6) is preferred 
Rj is not required in (5.6). In other words, for scalar cases (2.4) and (2.8) are 
system cases (5.8a) requires more work than (5.6a). 

over (5.8) since 
identical, but in 

Numerical Flux for van Leer Differencing 


The numerical flux in this case is 


= \ +who - *»■+*]. 

(5.9a) 


where i? J + A is the eigenvector of A evaluated at some symmetric average of U? + A and U^ + A ; 
i.e., 


j*» 

+' 

II 

* 

+ *• 

(5.9b) 

and the elements of ^ J+ i are 


Pj+i = l a !+il S i+i 

(5.9c) 

where again a*. +A and 5*. +A are evaluated at some symmetric average of A 

and l /f +A and 

a l j+ x = a\uf +k ,Uf H ) 

(5.9d) 


(5.9e) 


with 
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v*+ i = u »i - J [(1 - + l 1 + *)A i+ itr] (5.9f) 

V? + j = Pj + J [(1 - + (1 + *)A^p] (5.9g) 

where each element of the quantities are defined in the same way as in (3.7) and (3.8). 
Numerical Flux for Osher and Chakravarthy Schemes 
Their numerical flux for a system can be written as 


H {1 ~ 0 h 

H j+$ ~ 4 h 


§ ^- +f _ 


(5.10a) 


Here the Hj+i is the first-order numerical flux 


(5.10b) 


where the elements of $ J+ i are = | a y + il a y + i 
The elements of the A *’s are 


A i+i ~ °!f+i a i+i 

(5.10c) 

” °i+i a .j+i 

(5.10d) 

|), and a*. + JL is (5.4), and 


•^+§ = minmod [Aj+i’Wj+tl 

(5.10e) 

= minmod a] 

(5.10f) 

A%l = minmod \A l + + ^pA l +_J\ 

(5.10g) 

A l jti = minmod • 

(5!0h) 


Collecting terms of i? J+ l , (5.10a) can be rewritten as 
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(5.11) 


fjOC _1 J 


n+*V + . - « J+ i [«, + J + 


.(Li*), 


| + ~4~ R i-h A i- 


The terms that are common to the numerical fluxes of Harten’s Hj* + A (5.6), van Leer’s HY+\ 
(5.9) and Osher and Chakravarthy’s (5.11) are Fj+ 1 , Fj, i? J + 1 and cij+i- Both and 

H VL require one matrix-vector multiplication R J+ 1 4> J+ a or R j+ a$ j+ i. Comparing (5.6b)- 
(5.6d) and (5.9c)-(5.9g), the computational effort between H h and H VL is very competitive. 
The first three terms on the right hand side of in (5.11) require similar operation count 

as H h and H VL . The last two terms of (5.11) consist of approximately 2/5 of the computation 
of the numerical flux H°+i- They are the extra computation that are not present in H h and 
H vl . Therefore, for 6 ^ — 1, H oc is approximately 40-45% more expensive to use than H h or 
H VL . For 6 = -1, H oc is approximately 20-25% more expensive than the other two schemes. 


Concluding Remarks 

The current note addresses an aspect of implementating three existing spatially five-point 
upwind schemes for systems of hyperbolic conservation laws. The reason for choosing the above 
methods is that they represent typical design principles often used in constructing second-order 
(or higher-order) upwind TVD schemes. The current conclusion on the relative computational 
effort among the three methods mainly addresses unsteady calculations. 

It is observed that the five-point upwind TVD scheme of Osher and Chakravarthy requires 
more operations than Harten or van Leer. The reason lies mainly in the different way of arriving 
the spatially second and third-order TVD schemes. The indication is that one can achieve the 
same spatial order of accuracy as Osher and Chakravarthy by using the lower operation count 
method of van Leer or Harten. Moreover, for the forward Euler time differencing, only Harten’s 
method is second-order in time and space; the other two are first-order in time but second-order 
in space. To achieve second-order in time as well as space, additional work is required for the 
van Leer and for the Osher and Chakravarthy methods. 

For steady-state applications, aside from accuracy requirements, convergence rate of a scheme 
is equally or more important than the CPU time per time step. The performance of an individual 
scheme in terms of convergence rate is highly problem dependent. Therefore the discussion 
here requires extensive numerical testing before a clearer picture can be drawn for steady-state 
applications. 
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